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A general model for the early recognition and colocalization of homologous 
DNA sequences is proposed. We show, on a thermodynamic ground, how the 
distance between two homologous DNA sequences is spontaneously regulated by 
the concentration and affinity of diffusible mediators binding them, which act 
as a switch between two phases corresponding to independence or colocalization 
of pairing regions. 

Chromosome recognition and pairing is a general feature of nuclear organi- 
zation. In particular, these phenomena have a prominent role (and are compar- 
atively better studied) in meiosis, the specialized cell division necessary for the 
production of haploid gametes from diploid nuclei. During the prophase of the 
first meiotic division, homologous chromosomes identify each other and pair via 
a still mysterious long-distance reciprocal recognition process [TJ [2j |3] . 

Many hypotheses exist on the mechanisms underlying the early stages of 
coalignment of homologs along their length (see ref.s in [TJ [5J [3]). A long- 
standing idea is that pairing may occur via unstable interactions, such as a 
direct physical contact between DNA duplexes (the "kissing model", see, e.g., 
4 ). Pairing initially based on non permanent interactions has the important 
advantage of preventing ectopic association between non-homologous chromo- 
somes, and avoid topologically unacceptable entanglements, leaving space to 
adjustments [4 1. Several mechanisms could contribute to the outcome of the 
process, e.g., costrained motion of chromosome in territories, bouquet forma- 
tion at telomeres, tethering to the nuclear envelope. While chromosome full 
alignment includes several stages, the early physical contact and colocalization 
could be driven by specific chromosomal regions bridged by molecular media- 
tors. In this complex scenario, though, the crucial question on the mechanical 
origin of early recognition and pairing remains unexplained. 

Here we explore the thermodynamic properties of a recognition/pairing mech- 
anism based on weak, biochemically unstable interactions between specific DNA 
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sequences and molecular mediators binding them. We show that randomly dif- 
fusing molecules can produce a long-distance interaction mechanism whereby ho- 
mologous sequences spontaneously recognize and become tethered to each other. 
This colocalization mechanism is tunable by two "thermodynamic switches", 
namely the concentration of molecular mediator and their affinity for their bind- 
ing sites. When threshold values in the concentration, or affinity, of mediators 
are exceeded, homologous sequences are joined together, else they move inde- 
pendently. 

Model: Our model includes (see FigO} two homologue segments involved 
in mutual recognition and pairing, described as a self-avoiding bead chains, a 
well established model of polymer physics [5] , and a concentration, c, of Brow- 
nian molecular factors having a chemical affinity, Ex, for them. We investi- 
gate the thermodynamics properties of the system by Monte Carlo (MC) com- 
puter simulations [5J- For computational purposes, chromosomal segments and 
molecules are placed in a volume consisting of a cubic lattice with spacing do 
(our space unit, of the order of the molecular factors length) and linear sizes 
L x = 2L, L y = L and L z = L (see FigJTJ). In each simulation, the 'beads' of 
the chromosomal segments start from a straight, vertical line configuration, at 
a distance L from each other, and molecular mediators from a random initial 
distribution. Diffusing molecules randomly move from one to a nearest neighbor 
vertex on the lattice. On each vertex no more than one particle can be present 
at a given time. The chromosomal segments diffuse as well on such a lattice 
performing a Brownian motion under the constraint that two proximal 'beads' 
on the string must be within a distance V3do from each other (i.e., on next or 
nearest next neighboring sites on the lattice). For the sake of simplicity, we dis- 
regard here the rest of the chromosomes and DNA segment ends are costrained 
to move tethered to the bottom and top plane of the system volume (FigQ]). 
When neighboring a chromosomal chain, molecules interact with it via a bind- 
ing energy Ex- Below, we mainly discuss the case where Ex is of the order 
of a "weak" hydrogen bond-like energy, say 3 kJ/mole, which at room temper- 
ature corresponds to Ex = 1.2fcT [7j. In our simulations, at each time unit 
(corresponding to a MC lattice sweep) the probability of a particle to move to a 
neighboring empty site is proportional to the Arrhenius factor ro exp(— AE/kT), 
where AE is the energy barrier in the move, k the Boltzmann constant and T 
the temperature [3 [8]. The factor ro is the reaction kinetic rate, depending on 
the nature of the molecular factors and of the surrounding viscous fluid, and 
sets the time scale. We employ ro = 30sec _1 , a typical value in biochemical 
kinetics. Averages are over up to 2048 runs from different initial configurations. 

Results: First we show how the interaction of chromosomes with molecular 
mediators drives colocalization. To this aim, we calculated the thermodynamic 
equilibrium value of the average square distance (relative to the system linear 
size L) between the two chromosomal segments: 



where N is the number of beads in each string (here N — L) and (r 2 (z)} 
is the average (over MC simulations) of the square distance of the beads at 
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'height' z. The average value of d 2 is maximal when the two 'chromosomes' 
float independently and decreases if parts of the polymers become colocalized, 
approaching zero when a perfect alignment is attained. 

The equilibrium distance, d 2 , depends on the concentration, c, of mediators. 
At low concentration (see Fig(2l e.g., c < c\) d 2 has a value of the order of 
the system size (around 40% of L 2 ), corresponding to the expected average 
distance of two independent strings undergoing Brownian motion in a box of 
size L; a typical configuration for c = 0.3% being shown in Fig[T] panel A). 
Indeed, the physical basis for the independence of chromosomes exposed to a 
low concentration of mediating molecules is intuitive: pairing can occur when 
bridges are formed by molecules attached to couples of binding sites. A single 
bridging event, however, can be statistically quite unlikely since 'weak' bonds are 
biochemically unstable and to form a bridge a diffusing molecule must first find 
(and bind) a site on one chromosome and then together they have to successfully 
encounter the second one. 

Fig[2] shows, however, that when c is higher than a threshold value, ct r 
(for Ex — X.2kT, c tr — 0.7%), d 2 collapses to zero: this is the sign that the 
two 'chromosomes' have colocalized; a typical picture of the system state, for 
c = 2.5%, is shown in panel C) of Fig[T] Actually, when c is high enough chances 
increase to form multiple bridges and, as they reinforce each other, configura- 
tions where molecules hold together the two polymers become stabilized. The 
threshold concentration value, ct r , corresponds to the point where such a pos- 
itive mechanisms becomes winning, and can be approximately defined by the 
inflection point of the curve d 2 (c). Alike phase transitions in finite-size systems 
[HIE] (see below), around c tr there is a crossover region which can be located, 
for instance, between the concentrations c\ and c 2 (see FigfS]) defined by the 
criterion that d 2 is close within 5% to the random or zero plateau value (for 
E x = 1.2fcT, Cl ~ 0.3% and c 2 ~ 2%). 

In FigJ2J along with the distance between chromosomes, d 2 , we plot the 
squared fluctuations of the distance (i.e., its statistical variance), AcP(c), as a 
function of the concentration of mediators. For c < c±, both d 2 (c) and Ad 2 (c) 
have the non zero value found for non interacting Brownian strings in the in- 
dependent diffusion regime (Ad 2 ~ 30%); instead, AgP(c) = for c > c 2 in 
the tight colocalization regime. Interestingly, in the crossover region, d 2 (c) is 
smaller than in the purely random regime, although it has marked fluctuations 
(AgP(c) can be even larger than d 2 (c)). This situation is illustrated by a pic- 
ture of a typical configuration, for c = 0.9%, shown in panel B) of Fig[TJ In 
such an intermediate regime chromosome couples are continuously formed and 
disrupted. 

Summarizing, our results show that colocalization is spontaneously induced 
by the 'collective' binding of molecular mediators and occurs only when c is 
above a critical value, Ct r , i.e., in the 'colocalization phase'. Conversely, when 
c is below ct r , d 2 (c) has the same value found for two non interacting Brown- 
ian strings. This is the 'random phase', where chromosomes are independent. 
The concentration of mediators acts as a switch between the two phases, while 
around the critical threshold chromosomes undergo transient interactions. 

A similar effect is found when, for a given (high enough) concentration, c, the 
chemical affinity, Ex, of binding sites is changed (see Fig[2] lower panel): when 
Ex is smaller than a threshold value, E tr , the two polymers float independently 
one from the other. Around E tr a crossover region is found, and as soon as 
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Ex gets larger than E tr , an effective attraction between polymers is established 
and they are spontaneously colocalized. Another potential layer of regulation 
of the system is the number of binding sites for molecular mediators. In fact, a 
reduction in the number of binding sites produces the same effect of a reduction 
in the affinity of mediators, that is, chromosomes become unable to find and 
bind each other. 

The pairing mechanisms illustrated above has a thermodynamics origin. It 
is a 'phase transition' [5] occurring when entropy loss due to polymer colocal- 
ization is compensated by particle energy gain as they bind both polymers, 
the lower Ex the higher the concentration, c, required. Actually, the transi- 
tion is found in a broad region of the (Ex,c) plane, as shown in Figl3] where 
the system phase diagram is plotted in a range of typical biochemical values 
of "weak" binding energies Ex- For very low values of Ex the colocalization 
can be, instead, impossible. The overall properties of such a phase diagram 
(independent v.s. colocalized chromosomes) are robust to changes in the model 
details, though the precise location of the different phases can be affected [8]. 
Summarizing, when soluble mediators bind a specific recognition sequence on 
homologous chromosomes, recognition and colocalization of homologs can oc- 
cur, as a result of a robust and general thermodynamic phenomenon, namely 
a phase transition occurring in the system. The higher the affinity of media- 
tors for chromosomal binding sites, the lower is the threshold concentration of 
mediators that promotes colocalization (see FigJ3]). 

Discussion: We described a general colocalization mechanism, grounded 
on thermodynamics, whereby specific regions of a pair of chromosomes can spon- 
taneously recognize each other and align. Physical juxtaposition is mediated by 
sequence-specific molecular factors that bind DNA via weak, non permanent, 
biochemical interactions. When the concentration/affinity of molecular media- 
tors is above a critical threshold an effective attraction between their binding 
regions is generated, leading to a close alignment; else chromosomes float away 
from each other by Brownian motion. In the threshold crossover region, pairing 
sites undergo transient interactions: the average distance is shorter than in the 
purely random regime, but marked fluctuations are observed. 

In our simulations, the two homologous pairing regions are described as 
polymers diffusing with their ends tethered to the upper and lower planes of the 
system box. This recalls telomeres tethering to the nuclear envelope observed 
at meiosis. While it is not a prerequisite for the switch mechanism, on the other 
hand, it can enhance the switch effects [U El 13] • Releasing such a constraint 
doesn't change the general results, but pairing regions would collapse in a more 
disordered geometry. The overall properties of the phase diagram (independent 
vs. colocalized chromosomes) are robust to changes in the model details [H]. A 
model including many a pair of chromosomes has longer equilibration times, as 
expected in a crowded environment, yet, its phase diagram is unchanged. The 
scenario is also unaltered in the case of mediators that interact with each other 
and aggregate. 

An implication of this model is that a cell can regulate the initiation of 
homologous chromosome interaction by up-regulating the concentration of me- 
diators or their affinity for DNA sites (e.g., through changes in the chromatin 
or by a chemical modification of the mediator). This switch has general and 
robust roots in a thermodynamics phase transition [5], irrespective of ultimate 
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molecular and biochemical basis. In real cells, specific short chromosomal re- 
gions ( "pairing centers" ) could mediate the early steps of homolog recognition, 
and act as a seed and reference point to a subsequent stable long scale chro- 
mosomal pairing, which could involve additional mechanisms. A speculation is 
that the threshold effect can be exploited to ensure a precise control of pairing 
formation/release, while the presence of a crossover region in concentration to 
reduce undesired entanglements. The initial binding molecules could, in turn, 
help the sequences in recruiting complexes later used to other purposes (e.g., in 
pairing stabilization, synapsis, recombination). 

In the present model individual mediators do not need to be strongly bind- 
ing to glue homologous chromosomes together, and any molecules with above 
threshold affinity can induce attraction. Specificity of colocalization among 
many chromosome pairs could be, indeed, obtained by sets of molecules bind- 
ing, with higher affinities, specific homologous sequences. While the molecular 
mediators considered here are supposed to have more than one "DNA bind- 
ing domain" , proteins that can bind a single DNA site, but are able to make 
protein-protein interactions, could also mediate co-localization. As a pair of 
linked proteins is, in fact, a single molecular mediator the thermodynamics pic- 
ture is unchanged. Finally, direct DNA duplex interactions [3] could replace, or 
help, binding molecules. A duplex kissing site would correspond in our model 
to a binding site with a molecular mediator already attached, so the overall 
behavior should be similar. 

Experimental discoveries on meiotic pairing have accomplished huge pro- 
gresses, but the mechanisms for homologue early coalignment are still unclear 
[HOE]- In C. elegans, for instance, homologs proper pairing is primarily regu- 
lated by special telomeric regions, known as "pairing centers" (PCs) [TO l fTT } [T2] . 
Homologous PCs interact, during early prophase, with HIM/ZIM Zn-finger pro- 
teins which are necessary to mediate pairing [T5J Q3| . Specific sites and proteins 
are also involved in meiotic pairing of Drosophila. In male, on the X and Y 
chromosomes, a 240bp repeated sequence in the intergenic spacer of rDNA acts 
as a pairing center, and autosomes pair, as well, by the interaction of a num- 
ber of sites (see ref.s in [2j [3]). A similar behavior is observed in Drosophila 
female [T31 |T5] . In Drosophila males, special proteins, SNM and MNM, have 
been also discovered which bind X-Y and autosomal pairing sites at prophase 
I, and are required for pairing [17) . The question is open whether the present 
model applies to such an experimental scenario. In a picture where pairing is 
mediated by unstable interactions, thermodynamics dictates, anyway, a precise 
framework showing that minimal "ingredients" , such as soluble DNA binding 
molecules and homologous arrays of binding sites, can in fact be sufficient for 
pairing if the balance of mediator concentration and DNA affinity is appropriate. 

Our thermodynamic switch theory is prone to experimental tests (e.g., the 
existence of threshold effects in mediator concentration, c). It can be exploited, 
as well, for a quantitative understanding of the effects on pairing, e.g., of dele- 
tions (which can be modeled here by reducing the binding site number within 
L), or of chemical modifications of binding sequences (modeled by changes in 
Ex), and to guide the search for candidates for chromosomal sites and inter- 
action mediators. Finally, the general message of the model may be applicable 
to various cellular processes that involve the spatial reorganization of DNA in 
nuclear space (e.g., organization of chromosomal loci and territories, justappo- 
sition of DNA sequences in transcriptional regulation, somatic pairing, pairing 
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of X chromosomes at the onset of X inactivation [Tl 121 [51 IT51 [TO1 1201 I2T1 1221 I23"] ) . 

We thank N. Kleckner and A. Storlazzi for very helpful discussions and 
critical reading of the manuscript. 
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c=0.3% c=0.9% c=2.5% 



Figure 1: Pictures of typical configurations, from computer simulations, of the 
model system at thermodynamic equilibrium, in the two described phases dis- 
cussed in Fig[2] (panel A, independent motion; panel C, colocalization) and their 
intermediate crossover region (panel B), for the shown values of the concentra- 
tion of molecular mediators, c (here Ex = 1.2kT). 
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Figure 2: Top panel The equilibrium chromosome average square distance, 
d 2 , is shown as a function of the concentration of binding molecules, c (here the 
molecule/chrom. affinity is Ex = X.2kT): for c < c tr ~ 0.7%, c? 2 approaches 
values as big as the system size and chromosomes are randomly and indepen- 
dently diffusing (horizontal dotted lines give the values found for pure random 
walks); for c > c tr , d 2 rapidly decays to zero, showing that they have colocal- 
ized. Around Ct r there is a crossover regime, approx. between c\ and C2, where 
chromosomes tend to align since d 2 is smaller than in the region where they 
move independently, but its fluctuations, Ad 2 , are of the order of d 2 ; here chro- 
mosomes are only transiently colocalizing. Bottom panel A similar behaviour 
is found when d 2 is plotted as a function of the chemical affinity, Ex, shown 
here for c = 0.1%. 
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Figure 3: This phase diagram shows the state of the two chromosomes at ther- 
modynamic equilibrium in a range of values of chemical affinity and concentra- 
tion of their molecular mediators, i.e., in the (Ex, c) plane. For small Ex and c, 
chromosomes move independently while, above a transition region, they spon- 
taneously colocalize. The transition line, Ct r (Ex), is marked by the heavy black 
line. Colocalization, thus, can be spontaneously attained by upregulation of 
mediator concentration, c, or of molecule chemical affinity, Ex , to chromosomal 
sequences. 
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